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Abstract Turkey flocks have experienced turkey coron- 
aviral enteritis sporadically in the United States since the 
1990s. Twenty-four field isolates of turkey coronavirus 
(TCoV) from multiple states in the United States were 
recovered from 1994 to 2010 to determine the genetic 
relationships among them. The entire spike (S) gene of 
each TCoV isolate was amplified and sequenced. Pairwise 
comparisons were performed using the Clustal W program, 
revealing 90.0 % to 98.4 % sequence identity in the full- 
length S protein, 77.6 % to 96.6 % in the amino terminus 
of the S1 subunit (containing one hypervariable region in 
Sla), and 92.1 % to 99.3 % in the S2 subunit at the 
deduced amino acid sequence level. The conserved motifs, 
including two cleavage recognition sequences of the S 
protein, two heptad repeats, the transmembrane domain, 
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and the Golgi retention signal were identified in all TCoV 
isolates. Phylogenetic analysis based on the full-length S 
gene was used to distinguish North American TCoV iso- 
lates from French TCoV isolates. Among the North 
American TCoV isolates, three distinct genetic groups with 
100 % bootstrap support were observed. North Carolina 
isolates formed group I, Texas isolates formed group I, 
and Minnesota isolates formed Group III. The S genes of 
24 TCoV isolates from the United States remained con- 
served because they contained predominantly synonymous 
substitutions. The findings of the present study suggest 
endemic circulation of distinct TCoV genotypes in differ- 
ent geographic locations. 


Introduction 


Turkey coronaviral enteritis of varying severity, caused by 
turkey coronavirus (TCoV), has been reported in turkey 
flocks from multiple states in the United States since the 
1990s. The major clinical signs of TCoV infection include 
depression, ruffled feathers, diarrhea, decreased body 
weight, and uneven flock growth. The most apparent gross 
lesions are markedly distended intestines with gaseous and 
watery content, particularly in the ileum and ceca. Salient 
histopathologic findings include shortening of the intestinal 
villi, an increase in crypt depth, and widening of intervil- 
lous spaces [1]. When turkeys are infected with TCoV and 
other infectious agents such as astrovirus, small round 
virus, and Escherichia coli (E. coli), they can develop poult 
enteritis-mortality syndrome (PEMS), which causes high 
mortality [2, 3]. Subsequent experimental studies of the 
TCoV isolates VR-911 and TCoV/ON/MG10/08 from 
Canada have shown that TCoV can cause symptoms 
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similar to those caused by PEMS [3, 4]. Therefore, TCoV 
has been suggested to be the major causative pathogen for 
turkey enteritis, and secondary infections caused by other 
opportunistic microorganisms enhance the severity of 
TCoV enteritis and contribute to the development of 
PEMS. Turkey enteritis associated with TCoV infection 
has caused substantial economic losses in Indiana, North 
Carolina, Arkansas, and other states in the United States [5, 
6], as well as in Canada [4], Europe [2, 7], and Brazil [8]. 
Currently, there are no vaccines to prevent the disease, and 
treatment of infected turkeys is often unsuccessful. 

A member of the species Avian coronavirus (CoV) in the 
genus Gammacoronavirus and family Coronaviridae, TCoV 
has a positive single-stranded RNA genome that is approxi- 
mately 27 Kb in size. The major structural proteins of TCoV 
include the spike (S), envelope (E), matrix (M), and nucleo- 
capsid (N) proteins. Comparisons of 3’-end coding regions [9, 
10] as well as the full genomes [11] of TCoV isolates and 
infectious bronchitis virus (IBV) have suggested that TCoV 
arises through recombination in the S gene, because pairwise 
comparisons of S gene sequences have revealed only a 34 % 
similarity between TCoV isolates and IBV strains, whereas 
gene 3, M gene, gene 5, and N gene sequences have over 80 % 
similarity [9, 11, 12]. The S gene sequences of different TCoV 
isolates (93 %-99.7 %) are more conserved than those of 
various IBV strains (67.4 %-94.4 %), which could explain the 
close antigenic relationship of TCoV isolates compared with 
the distant antigenicity of different IBV serotypes [5, 10, 13, 
14]. Investigations during TCoV outbreaks and genomic 
analyses of TCoV isolates have revealed that distinct TCoV 
isolates tend to circulate endemically, and their respective 
sequences group phylogenetically according to their state of 
origin [6, 7, 11]. However, these observations may have been 
biased because of the small number of TCoV sequences that 
were analyzed. In the present study, 24 TCoV isolates were 
recovered from clinical cases submitted to the Indiana Animal 
Disease Diagnostic Laboratory at Purdue University by turkey 
farms in Minnesota, Indiana, North Carolina, Missouri, 
Arkansas, Texas, South Carolina, and Pennsylvania between 
1994 and 2010. The objective of the present study was to 
elucidate the relationship between the genotypes and geo- 
graphic distribution of TCoV isolates from turkey farms in 
multiple states in the United States by using sequence analysis 
and comparing the full-length S gene. 


Materials and methods 
Clinical samples and virus purification 
Twenty-four field isolates of TCoV were recovered from 


clinical cases submitted to the Animal Disease Diagnostic 
Laboratory at Purdue University by turkey farms in 
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Minnesota, Indiana, North Carolina, Missouri, Arkansas, 
Texas, South Carolina, and Pennsylvania between 1994 
and 2010 (Table 1). Field cases of TCoV were confirmed 
by clinical signs, gross lesions, histopathologic findings, 
immunofluorescence antibody (IFA) assay with antiserum 
against TCoV/IN/540/94, electron microscopy, and reverse 
transcription polymerase chain reaction (RT-PCR). All 24 
TCoV isolates were propagated five times in embryonated 
turkey eggs as described previously [1]. In brief, intestines 
from TCoV-infected turkeys were homogenized as 20 % 
suspensions in chilled sterile phosphate-buffered saline and 
clarified by centrifugation at 3000 rpm for 10 minutes at 
4 °C. The supernatant was filtered through a 0.22-um 
membrane filter (Millipore, Bedford, MA, USA). The fil- 
trate was inoculated into the amniotic cavity of 22-day-old 
embryonated turkey eggs. The embryo intestines were 
harvested after 3 days of incubation for virus purification. 
The harvested intestines were homogenized and clarified at 
3000 rpm for 10 minutes at 4 °C. The supernatant was 
layered on top of 30 % and 60 % sucrose and clarified 
using ultracentrifugation in an SW28 rotor at 24,000 rpm 
for 3 hours at 4 °C in an Optima XL-100K ultracentrifuge 
(Beckman Coulter, Fullerton, CA, USA). The interface 
between 30 % and 60 % sucrose was collected and placed 
on top of a continuous 40 %-60 % sucrose gradient and 
clarified by ultracentrifugation at 24,000 rpm for 20 hours 
at 4 °C. A band of buoyant density 1.16—1.24 g/mL (con- 
taining TCoV) was collected and saved at -80 °C as the 
viral stock. 


RNA and cDNA 


The viral RNA was extracted from the purified virus using 
RNApure™ Reagent (GenHunter, Nashville, TN, USA) 
and chloroform, followed by precipitation using cold iso- 
propyl alcohol and ethanol. The extracted RNA was 
reverse transcribed to cDNA using SuperScript™ II 
reverse transcriptase (Invitrogen, Carlsbad, CA, USA) 
according to the manufacturer’s instructions. The first 
reaction was 10 minutes of incubation with the RNA, 
random hexamer primer (100 ng/uL), and 10 mM dNTPs at 
70 °C, followed by 1 minute on ice. The second reaction 
was 5 minutes of incubation with a mixture of 5x first- 
strand buffer, 0.1 M of dithiothreitol (DTT), 1 U of 
SuperScript’ II reverse transcriptase, and 40 U of 
RNaseOUT™ (Invitrogen, Carlsbad, CA, USA) at 25 °C, 
followed by 1 hour of incubation at 50 °C and 15 minutes 
of inactivation at 70 °C. 


PCR amplification and sequencing 


The full-length S gene (approximately 3.9 kb) was ampli- 
fied by PCR using the cDNA of each TCoV isolate with the 
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Table 1 Turkey coronavirus 
(TCoV) isolates and other 
coronaviruses used in the 
molecular analysis of the spike 
(S) gene 


Isolate* 


TCoV/MN/ATCC/76 
TCoV/MN/3 10/96 
TCoV/IN/5 17/94 
TCoV/IN/5 17-Purdue/94 
TCoV/IN/540/94 
TCoV/IN/67 1/04 
TCoV/IN/834/04 
TCoV/IN/287/09 
TCoV/IN/421/10 
TCoV/NC/1020/96 
TCoV/NC/DK9/97 
TCoV/NC/1440/99 
TCoV/NC/183-1/02 
TCoV/NC/625/02 
TCoV/NC/20/09 
TCoV/NC/1741/12 
TCoV/NC/1729/12 
TCoV/NC/1728/12 
TCoV/MO/168/99 
TCoV/MO/2216/99 
TCoV/MO/2580/99 
TCoV/MO/333/06 
TCoV/AR/428/96 
TCoV/AR/1425/96 
TCoV/AR/1811/12 
TCoV/AR/1913/12 
TCoV/AR/1744/12 
TCoV/TX/1038-Purdue/98 
TCoV/TX/1038/98 
TCoV/TX/GL/01 
TCoV/TX/176-2/02 
TCoV/SC/284/96 
TCoV/PA/682/98 
TCoV/PA/640/02 
TCoV/V A/74/03 
TCoV/ON/MG10/08 
TCoV/FR07034 14/07 
TCoV/FR080147c/08 
TCoV/FR080183j/08 
TCoV-0/BR/08 
IBV/Beaudette CK 
IBV/Arkansas DPI 
Guinea fowl CoV/FR/2011 
Beluga whale SW1 CoV 


Year 


1976 
1996 
1994 
1994 
1994 
2004 
2004 
2009 
2010 
1996 
1997 
1999 
2002 
2002 
2009 
2012 
2012 
2012 
1999 
1999 
1999 
2006 
1996 
1996 
2012 
2012 
2012 
1998 
1998 
2001 
2002 
1996 
1998 
2002 
2003 
2008 
2007 
2008 
2008 
2008 
1937 
1981 
2011 
2008 


primers Sup and Sdown3 (Online Resource 1). The mixture 
(64:1, v:v) of Taq (Promega Corp., Madison, WI, USA) 
and Pfu DNA polymerases (Stratagene, La Jolla, CA, USA) 
with proofreading ability was used in a 96-well thermal 


Location Accession number Reference 
Minnesota, USA EU022526 [15] 
Minnesota, USA KF652237 This study 
Indiana, USA GQ427175 [11] 
Indiana, USA KF652222 This study 
Indiana, USA EU022525 [15] 
Indiana, USA KF652220 This study 
Indiana, USA KF652221 This study 
Indiana, USA KF652219 This study 
Indiana, USA KF652218 This study 
North Carolina, USA KF652227 This study 
North Carolina, USA  KF652240 This study 
North Carolina, USA  KF652238 This study 
North Carolina, USA = KF652230 This study 
North Carolina, USA  KF652224 This study 
North Carolina, USA = KF652223 This study 
North Carolina, USA KJ146015 [6] 

North Carolina, USA KJ146017 [6] 

North Carolina, USA KJ146018 [6] 
Missouri, USA KF652229 This study 
Missouri, USA KF652231 This study 
Missouri, USA KF652232 This study 
Missouri, USA KF652239 This study 
Arkansas, USA KF652234 This study 
Arkansas, USA KF652228 This study 
Arkansas, USA KJ146016 [6] 
Arkansas, USA KJ146019 [6] 
Arkansas, USA KJ146020 [6] 

Texas, USA KF652225 This study 
Texas, USA GQ427176 [11] 
Texas, USA GQ427174 [11] 
Texas, USA KF652226 This study 
South Carolina, USA  KF652233 This study 
Pennsylvania, USA KF652236 This study 
Pennsylvania, USA KF652235 This study 
Virginia, USA GQ427173 [11] 
Ontario, Canada EU095850 [4] 

France GQ411201 [7] 

France FN434203 [7] 

France FN545819 [7] 

Brazil HQ853282 Direct submission 
United Kingdom AJ311317 [23] 
Arkansas, USA GQ504720 [24] 
France HF544506 [25] 
Missouri, USA EU111742 [26] 


cycler (GeneAmp, Perkin-Elmer Cetus Corp., Norwalk, 
CT, USA) to maintain the fidelity of the PCR [9]. The PCR 
products were electrophoresed on 1 % agarose gels and 
purified using a Zymoclean’™ Gel DNA Recovery Kit 
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(Zymo, Irvine, CA, USA) for further sequencing. Several 
primers (Online Resource 1) designed to sequence over- 
lapping fragments covering the full length of the S gene of 
TCoV were used to determine the nucleotide sequences of 
the purified PCR products at the Purdue University 
Genomics Core Facility (West Lafayette, IN, USA). In 
addition, the purified PCR product was cloned into the 
PCR-II plasmid vector and used to transform E. coli strain 
TOPIOF’ according to the manufacturer’s instructions 
(Invitrogen, Carlsbad, CA, USA). 


Sequence analysis 


Twenty-four TCoV isolates collected from eight states in 
the United States from 1994 to 2010 were purified and 
sequenced in our laboratory. TCoV/IN/549/94 was ana- 
lyzed in a previous study [15], and the full-length S gene 
sequences of the other 23 TCoV isolates were published for 
the first time in the present study (Table 1). The putative 
peptide cleavage site separating the amino-terminus of the 
S1 subunit from the carboxy] terminus of the S2 subunits as 
well as a second possible peptide cleavage site in the S2 
subunit were detected using the ProP server (http://www. 
cbs.dtu.uk/services/ProP/). The S1 subunit of TCoV was 
further designated as Sla at the amino-terminus (1-204 in 
TCoV/IN/540/94) and S1b at the carboxyl-terminus (205- 
536 in TCoV/IN/540/94) [7]. Figure 1 shows a diagram of 
the S protein of TCoV. The nucleotide and deduced amino 
acid sequence similarities of the S genes of all 24 TCoV 
isolates were analyzed using the Clustal W alignment 
method in MEGA6 [16]. A phylogenetic tree based on full- 
length nucleotide sequences of the S gene was constructed 
using the maximum-likelihood method and the Kimura 
2-parameter model. A phylogenetic tree based on deduced 


Fig. 1 Schematic diagram of 
the spike protein of turkey 
coronavirus. TRS, transcription 
regulatory sequence; SP, signal 
peptide; HVR, hypervariable 
region; HR, heptad repeat; TM, 
transmembrane domain; CP, 
cytoplasmic peptides. Numbers 
in parentheses indicate the 
amino acid position from the 
start codon of the spike protein 
from TCoV/IN/540/94 
(EU022525) 


S2 (537-1203) 


t 


2™4 S cleavage site 
(698/699) 
NQGR/STIEDLLF 
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amino acid sequences of the Sla subunit, containing a 
hypervariable region (HVR), was constructed using the 
neighbor-joining method and the Jones—Taylor—Thornton 
model. A codon-based Z-test of positive selection for S 
gene sequences of various TCoV isolates was conducted to 
analyze the differences in the number of nonsynonymous 
(dN) and synonymous (dS) substitutions per site by using 
the Nei—Gojobori method [17] in MEGAG6. The variance of 
both trees and codon-based Z-test were validated using 
1000 bootstrap replicates. 


Nucleotide sequence accession numbers 


The S sequences of TCoV isolates reported in the present 
study were submitted to the GenBank databse, and their 
accession numbers ranged from KF652218 to KF652240. 
The accession numbers of other CoVs used for phyloge- 
netic analysis are also listed in Table 1. 


Results 


Genetic analysis of the spike sequences of turkey 
coronavirus isolates 


The sizes of the S genes of TCoV isolates reported in the 
present study ranged from 3609 to 3630 nucleotides. All 24 
TCoV isolates exhibited similar S protein sequences 
(Fig. 1). The consensus transcription-regulating sequence 
(TRS), CTGAACAA, was identified 52 nucleotides 
upstream of the start codon of the S protein. Several con- 
served motifs and one HVR were found in all TCoV iso- 
lates, and their sequences are listed for comparison in 
Table 2. The consensus motif RXRR/X (X is any amino 


$1 (1-536) 


RXRR/S > S2 


S1b (205-536) | $2 (537- 


Neutralizing 
epitopes 
(416-519) 












14 aa insertion 
(1000-1014) 
XXLYXYSDVIDKIN 









Genotyping of TCoV isolates 


Table 2 Conserved motifs in the spike (S) protein cleavage sites and the sequences of the hypervariable region (HVR) of turkey coronavirus 


(TCoV) isolates 


Isolate 


TCoV/ATCC/MN/76 
TCoV/310/MN/96 
TCoV/IN/5 17/94 
TCoV/IN/5 17-Purdue/94 
TCoV/IN/540/94 
TCoV/IN/67 1/04 
TCoV/IN/834/04 
TCoV/IN/287/09 
TCoV/IN/421/10 
TCoV/NC/1020/96 
TCoV/NC/DK9/97 
TCoV/NC/1440/99 
TCoV/NC/183-1/02 
TCoV/NC/625/02 
TCoV/NC/20/09 
TCoV/NC/1728/12 
TCoV/NC/1729/12 
TCoV/NC/1741/12 
TCoV/MO/168/99 
TCoV/MO/2216/99 
TCoV/MO/2580/99 
TCoV/MO/333/06 
TCoV/AK/428/96 
TCoV/AK/1425/96 
TCoV/AK/1744/12 
TCoV/AK/1913/12 
TCoV/TX/1038-Purdue/98 
TCoV/TX/1038/98 
TCoV/TX/GL/01 
TCoV/TX/176-2/02 
TCoV/SC/284/96 
TCoV/PA/682/98 
TCoV/PA/640/02 
TCoV/V A/74/03 
TCoV/ON/MG10/08 
TCoV-0 ~ 10/BR/08 
TCoV/FR07034 14/07 
TCoV/FR080147c/08 
TCoV/FR080183j/08 


“ The positions of HVR, S1/S2 cleavage, and S2 second predicted cleavage site in parentheses were relative to the start codon of the spike 


S full length (aa) 


1203 
1203 
1226 
1205 
1203 
1209 
1209 
1209 
1208 
1208 
1209 
1207 
1208 
1208 
1209 
NA? 
NA” 
NA” 
1207 
1206 
1206 
1206 
1203 
1204 
NA” 
NA” 
1202 
1223 
1225 
1207 
1207 
1203 
1208 
1226 
1226 


1198 
1198 
1198 


protein of TCoV/IN/540/94 (EU022525) 


> NA: data not available 


acid, R is arginine, and slash [/] indicates the cleavage 
position) was found at the cleavage site for the S1 and S2 
subunits in 22 TCoV isolates analyzed in the present study 
and the French TCoV isolates, except for the isolates 
TCoV/MN/310/96 and TCoV/PA/682/98, which had the 


Hypervariable region (HVR) (126-134)* S1/S2 cleavage (536)* S2 2nd site (698)* 


LSMGN——ST RRRR ATS NQGR/S 
LSMGN——ST RRRR ATS NQGR/S 
LATNVGKVTPP-ND RKRR/S NQGR/S 
LGT—KVGD-HD RERR/S NQGR/S 
LGT—KVGD-HD RERR/S NQGR/S 
IDTNVGK VTPP-HE RKRR/S NQGR/S 
VYTNVGKVTPP-NE RKRR/S NQGR/S 
LITNVGD—HD RKRR/S NQGR/S 
IETNVGK-TPP-ND RRRR/S NQGR/S 
LVTNAGNVKP-DD RKRR/S NQGR/S 
LHTNVGKVTPP-ND RKRR/S NQGR/S 
VSTNAGKVR—_ND RERR/S NQGR/S 
IGTNVGKKPP-NE RKRR/S NQGR/S 
VATNVGKSAPWAND RRRR/S NQGR/S 
INTNAGKITLP-NE RKRR/S NQGR/S 
MDTNVGK-PN-ND RRRR/S NA” 

IDTNVGK-PN-ND RRRR/S NA” 

IDTNVGK-PN-NH RRRR/S NAP 

VGTNAGKQ—KE RKRR/S NQGR/S 
VSTNAGQKP—NE RKRR/S NQGG/S 
IGTNAGKV—ND RKRR/S NQGR/S 
IDTNAGKVGP-KD RKRR/S NQGR/S 
LKVAN-SH— RKRR/S NQGR/S 
IETNVGK-SH—NE RKRR/S NQGR/S 
VGTNAGKVSAPNE RKRR/S NAD 

IDTNVGK-S-YD RKRR/S NA” 

INTKG——ND RMRR/S NQGR/S 
IGTNADKKS—ND RTRR/S NQGR/S 
IEINAGEVT—ND RTRR/S NQGR/S 
IGTNADKKS—ND RMRR/S NQGR/S 
LVTNVGNVKT-AD RKRR/S NQGR/S 
LSMGN——ST RRRW ATS NQGR/S 
IYTNVGKLTP-ND RKRR/S NQGR/S 
LATNVGKVTPPND RKRR/S NQGR/S 
LATNVGKVTPPND RKRR/S NQGR/S 
LATNVGKVTPPND RKRR/S NQGR/S 
SIGNFVLNS RSRR/S PQGR/S 
SIGNFVLNS RSRR/S PQGR/S 
SIGNFVLNS RSRR/S PQGR/S 


amino acid sequence ATS followed by the cleavage site, 
similar to TCoV/MN/ATCC/76. A conserved sequence, 
NQGR/S, resembling the furin-dependent cleavage site in 
IBV [18] was identified in the S2 subunit in all TCoV 
isolates except the isolate TCoV/MO/2216/99, in which the 
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critical arginine was mutated to glycine and the second 
probable protein cleavage site was lost. Rather than 
NQGR/S, French TCoV isolates had the PQGR/S sequence 
as the conserved cleavage motif in the S2 subunit. Only 
one HVR, spanning amino acid positions 126 to 134 
(TCoV/IN/540/94) was found in the TCoV isolates, rather 
than the three HVRs identified in IBV [7]. Two 14-amino- 
acid insertions in heptad repeats (HR1 and HR2), the 
consensus motif (YIKWPWYVWL) in the transmembrane 
domain, and the late Golgi retention signal (YYTTF) for S 
protein were also observed in all 24 TCoV isolates. Among 
the 45 amino acid residues of the neutralizing-epitope- 
containing S fragment in the S1 subunit identified in a 
previous study [19], 33 consensus residues were observed 
among the 24 TCoV isolates (Online Resource 2). 


Comparison of the spike sequences of turkey 
coronavirus isolates 


A pairwise comparison of the deduced amino acid 
sequences of the 24 TCoV isolates showed that the 
sequence identity ranged from 90.0 % to 98.4 % for the 
full-length S protein, 77.6 % to 96.6 % for the Sla subunit 
containing HVR, and 92.1 % to 99.3 % for the S2 subunit 
(Online Resource 3). No positive selection for the S gene 
was observed among the TCoV isolates (Online Resource 
4). The values of ds were greater than those of dy in all 
comparisons except three pairs of TCoV isolates (TCoV/ 
IN/540/94 and IN/517-Purdue/94, IN/671/04 and IN/834/ 
04, MO/2216/99 and MO/168/99), which had similar ds 
and dy values and shared high sequence identity in the S 
gene, exceeding 95 %. 


Phylogenetic analysis of the spike gene 


Phylogenetic trees based on the full-length S nucleotide 
sequences (Fig. 2A) and Sla amino acid sequences con- 
taining HVRs (Fig. 2B) of different CoVs of the genus 
Gammacoronavirus were generated. As shown in Fig- 
ures 2A and 2B, the IBV strains were separated from 
TCoV isolates, and North American TCoV isolates were 
separated from French TCoV isolates. Three genetic 
groups, referred to as groups I, II, and III, were observed in 
North American TCoV isolates (Fig. 2A). Group I included 
all North Carolina isolates except TCoV/NC/1020/96, 
Indiana isolates from 2004 and 2010, Missouri isolates 
from 1999, and isolates from Pennsylvania, Virginia, and 
Canada. Group II contained all Texas isolates, Indiana 
isolates from 1994 and 2009, one Missouri isolate each 
from 1999 and 2006, and North Carolina, South Carolina, 
and Arkansas isolates from 1996. Group III was composed 
of two Minnesota isolates that were isolated 20 years apart 
and TCoV/PA/682/98, and they had sequence identities 
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higher than 98.5 %. Because of the high degree of varia- 
tion, most phylogenetic groupings based on the Sla 
deduced amino acid sequences did not have a bootstrap 
value over 50 % (Fig. 2B). Nevertheless, the Texas TCoV 
isolates of group II and all three TCoV isolates of group III 
shown in the phylogenetic tree based on the full-length S 
nucleotide sequences still clustered according to their Sla 
amino acid sequences containing their HVR. 


Discussion 


Turkey coronavirus isolates from different geographic 
areas in the United States have been shown to be anti- 
genically related to one another [5]. In the present study, an 
antiserum against the isolate TCoV/IN/540/94 reacted with 
all 24 TCoV isolates from TCoV-infected turkeys and 
embryos by IFA assay (data not shown). The close anti- 
genicity among TCoV isolates was associated with the high 
similarity of the S gene sequences, which ranged from 
90.4 % to 99.4 %. The S genes of TCoV isolates were 
conserved compared with the diverse S genes among var- 
ious IBV strains, which range from 67.4 % to 94 %, 
resulting in the existence of many serotypes of IBV [10]. 
The emergence of new IBV serotypes has been postulated 
to involve the recombination of S genes of vaccine strains 
of IBV in the field [20, 21]. In a previous study, different 
serotypes of TCoV/VA/73/03, TX/1038/98, and IN/517/94 
were identified by using a neutralization test in conjunction 
with real-time RT-PCR despite the high level of amino acid 
sequence identity (96 % to 98 %) among these TCoV 
isolates [11]. Additional studies are necessary to clarify the 
antigenic relationships among the various TCoV isolates 
and serotypes. 

In the present study, similar to previous findings with 
IBV strains, most of the variations in the S protein 
sequences among TCoV isolates were observed in the 
amino-terminal half. Along the alignment of these S pro- 
tein sequences, the region of sequence with the most 
variation was between residues 126 and 134 of TCoV/IN/ 
540/94 from the start codon of the S protein. Various 
deletions occurred in this region in different TCoV isolates. 
This region is in the vicinity of HVR II (residues 117 to 
131) of the IBV S protein. HVR I (residues 56 to 69), II, 
and III (residues 250 to 365) of the IBV S protein are 
associated with three neutralizing epitopes. The sequences 
of these regions could be used for differential diagnosis of 
IBV serotypes [13, 22]. By contrast, similar regions of high 
variation corresponding to HVR I or III of IBV were not 
detected among the TCoV isolates examined. These dif- 
ferences illustrate why the S proteins of IBV strains are 
more diverse than those of TCoV isolates. Similar phylo- 
genetic trees were constructed using the full-length S and 
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Fig. 2 Consensus bootstrap phylogenetic tree based on the full- 
length spike (S) gene nucleotide sequences (A) and the Sla amino 
acid sequences (B) of turkey coronavirus (TCoV) isolates, infectious 
bronchitis virus (IBV) strains, guinea fowl (Gf) CoV, and beluga 
whale SW1 CoV (accession number in parentheses). The nucleotide 


Sla proteins. Thus, genotyping TCoV field isolates based 
on the Sla sequence containing HVR rather than the whole 
S1 gene or full-length S gene is more practical. 

The observation that TCoV isolates originating from 
the same state were closely clustered together in the 
phylogenetic tree suggested endemic circulation of dis- 
tinct TCoV genotypes in various geographic locations. 
Endemic circulation of distinct TCoV genotypes in 
France and North American are recognized because 
French TCoV isolates share only 60 % amino acid 
sequence identity in the S protein with North American 
TCoV isolates [7]. Distinct sources of recombination 
promoting the emergence of TCoV in North America 
and Europe has been suggested [7, 11]. The groupings of 
North Carolina, Texas, and Minnesota TCoV isolates 
also support the theory of endemic TCoV genotypes. The 
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sequence tree was constructed by the maximum-likelihood method 
and the Kimura 2-parameter model, and the amino acid sequence tree 
was constructed by the neighbor-joining method and the Jones- 
Taylor-Thonton model in MEGA 6. The bootstrap values were 
calculated from 1000 trees 


TCoV isolates from the outbreaks in Arkansas and North 
Carolina in 2012 also clustered geographically [6] and 
could be placed phylogenetically in group I in the pre- 
sent study. The 99.3 % amino acid sequence identity of 
the S proteins of two TCoV isolates recovered 20 years 
apart in Minnesota (TCoV/MN/ATCC/76 and TCoV/MN/ 
310/96) implied that the TCoV isolate MN/ATCC/76 
remained endemic and that no substantial genetic chan- 
ges occurred over two decades. Conservation of TCoV 
isolates is also shown in the result that no positive 
selection of the S protein was found among the 24 
TCoV isolates. Because Indiana isolates from 2004 and 
2010 clustered in group I with most North Carolina 
isolates and Indiana isolates from 1994 and 2009 clus- 
tered in group II with Texas isolates, it is most likely 
that the turkey sources were the same for the turkey 
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farms in North Carolina and Indiana in 2004 and 2010, 
whereas the turkey sources were the same for Texas and 
Indiana in 1994 and 2009. 

In conclusion, the relationship between TCoV genotypes 
and the geographic distribution of TCoV presented in the 
present study provides crucial information for the moni- 
toring and control of diseases associated with TCoV 
infection in the United States. 
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